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[57] 



ABSTRACT 



An electronic spreadsheet system of the present invention 
includes a Formula Evaluator having a preferred interface 
and methods for compiling spreadsheet formulas into native 
machine language for a target processor/coprocessor. Meth- 
ods are provided for improved cell lookups, whereby a 
system memory address for a cell record may be compiled 
directly into the machine language sequence for a compiled 
formula. Additional methods are provided for simplifying 
error checking. The methods adapt NaN (IEEE format) 
values for encoding spreadsheet ERR and NA values, so that 
the on-chip logic of the coprocessor may be employed for 
correctly propagating ERR and NA values. Preferred audit- 
ing methodology is also described, which includes further 
modifying the format of NaN values so that a NaN encoding 
an ERR or NA also embeds the address of the originating 
cell (i.e., the first cell which first gave rise to the ERR or NA 
value). 

7 Claims, 18 Drawing Sheets 
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IN-MEMORY 
CELL RECORD 



pCF. 



1 



FORMULA RESULT «- 
FORMULA BYTE CODE 



CODE SEGMENT A 



HELPER FUNCTIONS 
705 



COMPILED FORMULA 



1 



COMPILED FORMULA 



COMPILED FORMULA 



730 



CODE SEGMENT B 



HELPER FUNCTIONS 
705 



COMPILED FORMULA 



X+1 



COMPILED FORMULA 



X+2 



COMPILED FORMULA , 



710 



720 



RUNTIME FORMULA EVALUATION: 

if ( pCF != 0 ) 

call pCFl; 
else 

call Pcode evaluate; 



COMPILED FORMULA 



pCELLJ 

pCELLN 

CONSTANTJ 

CONSTANTJI 

pRESULT 

START: 



JMPSTORE_AND_EXIT 



DATA 



COMPILED 
CODE 



750 



FIG. 7 
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COMPILE THE FORMULA STORED IN B1: +C3 T4/900 



IN-MEMORY LAYOUT (CODE SEGMENT): 



801 



0000 



NEAR LIBRARY (HELPER FUNCTIONS) 



PUSH_VALUE : 
STORE AND EXIT: 



810 



COMPILED FORMULA 



1 



811 



0261 
0265 
0269 
02 6B 



;026F 



DATA (CELL LOOKUPS AND STATIC DATA) 



2DDF : FB3E (Stores pointer to Cell C3) 

2DDF:FC3C (Stores pointer to Cell F4) 

0x0384 (Stores 900) 

2DDF-.342 0 (Stores pointer to Bl Formula Result) 



COMPILED CODE (MACHINE CODE SEQUENCE) 



I 



813 



MOV BX, 02 61 
CALL PUSH_VALUE 
MOV BX, 0265 
CALL PUSH_VALUE 

FMULT ST(1) , ST 

MOV BX, 0269 

FILD CS:WORD PTR[BX] 

FDIVP ST(1) , ST 

MOV BX, 026B 

JMP STORE AND EXIT 



LOAD PTR TO C3 1 s RECORD 
CALL PUSH_VALUE 
LOAD PTR TO F4 1 s RECORD 
CALL PUSH_VAIUE 

MULTIPLY [C3] * [F4] 

LOAD ADDR OF CONSTANT (900) 

LOAD 900 ON T.O.S. 

DIVIDE ([C3] * [F4]) BY 900 

LOAD PTR TO Bl FORMULA RESULT 

STORE RESULT TO Bl AND EXIT 



COMPILED FORMULA, 



COMPILED FORMULA, 



FIG. 8 
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METHODS FOR COMPILING FORMULAS 
STORED IN AN ELECTRONIC 
SPREADSHEET SYSTEM 

This is a Divisional, Utility pat. application of Ser. No. 
08/205,922 filed Mar. 3, 1994, now U.S. Pat. No. 5,471,612. 

COPYRIGHT NOTICE 

A portion of the disclosure of this patent document 
contains material which is subject to copyright protection. 
The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent 
disclosure as it appears in the Patent and Trademark Office 
patent file or records, but otherwise reserves all copyright 
rights whatsoever. 

BACKGROUND OF THE INVENTION 

The present invention relates generally to the field of 
information processing by digital computers and, more 
particularly, to the processing and presentation of informa- 
tion by application programs, particularly electronic spread- 
sheets. 

Before computers, numerical analyses, particularly finan- 
cial ones, were usually prepared on an accountant's colum- 
nar pad or spreadsheet with pencil and calculator in hand. 
By organizing data into columns and rows, spreadsheets 
afford the rapid assimilation of information by a reader. The 
task of preparing a spreadsheet on paper, however, is not 
quite so fast. Instead, the process tends to be very slow, as 
each entry must be tediously calculated and entered into the 
spreadsheet Since all calculations are the responsibility of 
the preparer, manually prepared spreadsheets are also prone 
to error. Hence, preparation of spreadsheets by hand is slow, 
tedious, and unreliable. 

With the advent of microcomputers, a solution was forth- 
coming in the form of "electronic spreadsheets." Better 
known simply as "spreadsheets,** these software programs 
provide a computerized replacement for the traditional 
financial modeling tools: the accountant's columnar pad, 
pendL and calculator. In some regards, spreadsheet pro- 
grams are to those tools what wordprocessors are to type- 
writers. Spreadsheets offer dramatic improvements in ease 
of creating, editing, and using financial models. 

A typical spreadsheet program configures the memory of 
a computer to resemble the column/row or grid format of an 
accountant's columnar pad, thus providing a visual calcu- 
lator for a user. Because this "pad" exists dynamically in the 
computer's memory, however, it differs from paper pads in 
several important ways. Locations in the electronic 
spreadsheet for example, must be communicated to the 
computer in a format which it can understand. A common 
scheme for accomplishing this is to assign a number to each 
row in a spreadsheet and a letter to each column. To 
reference a location at column A and row 1 (i.e., the 
upper-left hand corner), for example, the user types in "Al". 
In this manner, the spreadsheet defines an addressable 
storage location or "cell" at each intersection of a row with 
a column. 

Data entry into an electronic spreadsheet occurs in much 
the same manner that information would be entered on an 
accountant's pad. After a screen cursor is positioned at a 
desired location, the user can enter alphanumeric informa- 
tion. Besides holding text and numeric information, 
however, spreadsheet cells can store special instructions or 
'formulas 1 ' specifying calculations to be performed on the 
data stored in spreadsheet cells. In this fashion, cell refer- 
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ences can serve as variables in an equation, thereby allowing 
precise mathematical relationships to be defined between 
cells. 

A particular advantage of electronic spreadsheets is the 

5 ability to create a multitude of "what if' scenarios from a 
single data model This ability stems largely from the 
spreadsheet formulas, which are the means by which a user 
tells an electronic spreadsheet system how to manipulate and 
analyze one's data. After a set of mathematical relationships 

10 has been entered into a worksheet, the spread of information 
can be "recalculated" using different sets of assumptions, 
with the results of each recalculation appearing relatively 
quick. Performing this operation manually, with paper and 
pencil, would require recalculating every relationship in the 

is model with each change made. Expectedly, electronic 
spreadsheets have quickly displaced their pencil-and-paper 
counterparts for modeling user information. 

The structure and operation of a spreadsheet program, 
including spreadsheet formulas and "macros," are docu- 

20 mented in the technical, trade, and patent literature. For an 
overview, see e.g., Cobb, D.. Using 1-2-3. Que Corp., 1985; 
and Campbell, M., Quattro Pro 4.0 Handbook, 4th Ed., 
1992. The disclosures of each of the foregoing references are 
hereby incorporated by reference. 

25 Spreadsheet formulas are fundamental to the creation and 
operation of a spreadsheet data model. During creation o f a ~ 
particular spreadsheet or worksheet model, a user ent ers ' 
fo rmulas in worksheet cells the sam e way he or she enters 
values and label s. Typically, a formula begins with a number * 
or with a special character (e.g., +.-.(,©,.,#, or $) to 
distinguish the entry from raw data. More particularly, 
formulas are constructed from one or more of four basic 
components: operators, values, cell references, and 

35 ©-functions. Each of these will be briefly reviewed. 

Operators, which indicate how user data are to be com- 
bined and manipulated, typically include well-known opera- 
tors such as +.*,/, and the like. Values are the informa tion 
(e.g., numeric, logical, or text) which formulas are to a ct 

40 ur>on. To include a value in a formula, the user simply types 
it as he or she desires it to appear. 

Cell references, on the other hand, allow a user to employ 
a value in one's current formula that is derived from another 
cell. For instance, if a cell having an address of B3 contains 

4 5 the value of 10, the formula 5+B3 placed in any other cell 
of the spreadsheet will evaluate to the result of 15. A 
referenced cell containing no value (blank) or a label is. 
depending on implementation, treated as zero or as an error 
(ERR). Much of the flexibility of electronic spreadsheets 

50 stems from this ability to establish relationships via cell 
references. 

The ©-functions are built-in formulas provided for 
frequently-used calculations. Some ©-functions perform a 
calculation on a single value, such as @SQKT(n) which 

55 calculates the square root of the number n. Other 
©-functions perform calculations on cell ranges, such as 
@SUM(B3..B10) which sums the values stored in cells B3 
to B10. ©-functions can be used in formulas where one 
migjit use a value or cell reference. 

60 Before- meaningful results of a given spreadsheet may be 
presented to a user, the formulas of a given spreadsheet m ust 
fi rst be evaluated or recalculated '' Recalculation" is the 
process by which a spreadsheet's cells, particularly fo rmula- j 
s toring ones, are evaluated to express valu es. Different/ 

65 recalculation orders are well known including, for instance, 
natural, rowwise, and columnwise. Each will be reviewed in 
turn. 
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Formula 


Comment 


B3 


03*B2 


Cost of goods - - calculated as 30% 






of B2 (Revenue stored in B2) 


B4 


5000 


Advertising expense 


B5 


+ B2-B3-B4 


Pre-tax profit 


B6 


0.28 * B5 


Taxes due - - calculated as 28% of 






B5 (Pre-tax profit) 


B7 


+ B5-B6 


Net profit is B5 (pre-tax profit) 






less B6 (taxes) 



15 
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Natural recalculation entails evaluating the least de pen- SUMMARY OF THE INVENTION 

dent cellsflrst In other words, the spreadsheet system first ^ ^ . . . A 

rec»eTtho S e cells upon which the formulas and tunc- ™ e Pf"? *. ventIon f uprises an electronic spcad- 
tions in other cells depend. Suppose, for example, that one sheet *y stem a Preferred interface and methods for 
desires to create a business forecast (data model) with the 5 compiling spreadsheet formulas into native inachine instruc- 
following assumptions: (1) Cost of goods is 30% of revenue, ^2L S * ^ svstem includes a Fgrmida^valuatgxtoccjHia- ^ \ 

(2) Advertising expenses are $5,000, and (3) Taxes are 28% i ng a given spreadsheet formula into a machine lang uage V ~ 

of pre-tax profit This may be modeled in a spreadsheet s equence to b e executed b y tne target processo^copr6ces_s or. * a\ u \^S^ 
model having the following cell entries: In this manner, the system may perform recalculation of Y\ . *u-J 

spreadsheet formulas in substanti al ly less time than is ^ (jl 
1 required by prior art, token-based interpreter systems. 

In an exemplary embodiment, the system of the present 
invention comprises the Formula Evaluator operating in 
conjunction with a stack-oriented floating-point unit (FPU) 
or "math coprocessor" which is compliant with the ihkk 
Floating-point standard (IEEE-754, Standard for Binary 
Floating-point Arithmetic). Besides evaluating expressions 
in a Reverse-Polish Notation (RPN) or postfix format, such - 
coprocessors include internal formats for representing posi- 
20 tive and negative infinity, and NaNs (Not a Number). These 
Now, the user may quickly forecast a net profit (value intemal representations are adapted by the Formula Evalu- 
expressed in cell B7) for a given revenue value (entered into «*f / or representing standard spreadsheet ERR (error) and 
cell B2). Amultitude of revenue values may be forecasted by NA ( not available ) values. 

simply entering different values into B2. To ma ^ £t compilation of formulas feasible, the system 

Interdependencies between the cells dictate how a spread- 25 includes improved methodology for cell record lookups (Le., 
sheet system actually performs recalculation. Before a value resolving cell addresses for retrieving cell values). Since a 
for cell B7 may be calculated, for instance, the value for cell math coprocessor loads operands from and stores results to 
B5 (and for cell Bd) must be known. Before B5 can be system memory, machine language for the coprocessor must 
calculated, a value for cell B3 (and cells B2 and B4) must include actual system memory addresses (not cell 
be known. Cell B2 must be evaluated before a value for B3 30 identifiers). The present invention recognizes that at runtime 
can be expressed. Because natural recalculation recognizes allocated memory blocks for cell information typically are 
these interdependencies, it is a recalculation method com- ^ Ci ^ (i- e -» wm * not De moved to a new location). This aspect 
monly employed in conventional electronic spreadsheets. is exploited to improve cell lookups as follows. Formulas are 

Rowwise recalculation, in contrast, moves across each compiled to native machine code on the first recalculation 
row from the left-most cell to the right-most cell, beginning 35 operation of a spreadsheet typically when a spreadsheet is 
with the top-most row in the model and working to the firs* loaded. Since the formulas are compiled after the cell 
bottom-most. Columnwise recalculation moves from the records have been loaded into system memory, the various 
top-most cell to the bottom-most cell in a column, starting cc ^ addresses in formulas may be resolved to actual 
with the left-most column and moving to the right-most addresses in system memory for the user session. In this 
Since the rowwise and columnwise recalculation methods 40 manner, the actual address for a cell (i.e., system memory 
typically do not recognize the interdependencies between address for the cell result stored in the in-memory cell 
cell values, multiple recalculation passes may be required to record) may be compiled directly into the machine language 
produce a recalculated spreadsheet sequence for the spreadsheet formula. 

Regardless of which recalculation order is employed, the The Formula Evaluator of the present invention also 
quickness or "speed" with which recalculation can be com- 45 includes improved error checking facilities, thereby elirni- 
pleted is often used as an indicator of the overall perfor- nating the need for incorporating numerous checks in com- 
mance of an electronic spreadsheet A slow recalculation piled formulas. Specifically, operation of the coprocessor is 
time for a spreadsheet system ensures that the user will adapted to provide native support for spreadsheet ERR and 
waste time waiting for the system to complete its recalcu- NA values, thus ensuring their proper propagation, 
lation. Quick recalculation, on the other hand, allows the 50 Specifically, NaN (IEEE format) values are employed for 
user to complete his or her task with minimum interruption. representing spreadsheet ERR and NA values. Since adding 
Accordingly, electronic spreadsheets which perform recal- a NaN (e.g., one representing an ERR or NA value) to a 
culations quickly enjoy greater commercial success. non-NaN yields a NaN with an unchanged mantissa bit 

With the present trend of employing electronic spread- pattern (Le., preserving it as an ERR or NA), the math 
sheets to model increasingly complex data models, the 55 coprocessor's rules for propagating NaNs may be employed 
ability of present-day electronic spreadsheet systems to for correctly propagating spreadsheet ERR and NA values, 
quickly evaluate models has become taxed. For instance, it Moreover, by defining ERR to have a larger mantissa bit 
is not uncommon for a financial model of even modest pattern than NA, the on-chip logic of the coprocessor may 
complexity to have a recalculation time on the order of be employed to correctly propagate ERR over NA (thus 
several minutes. While improved computer hardware, 60 satisfying standard spreadsheet rules for propagating ERR 
including more efficient floating-point processors and faster and NA values). 

clock speeds, has helped, many users of electronic spread- In the system of the present invention. NaNs are further 
sheets still spend a significant portion of their time waiting modified to encode cell addresses, for tracking first instances 
idly while their spreadsheets are being recalculated. of ERR and NA values. Specifically, particular bits of the 
Accordingly, there is much interest in improving the speed 65 mantissa are used to encode a unique cell identifier — one 
with which electronic spreadsheets perform recalculation of which is directly translatable into a particular cell address. In 
spreadsheets. an exemplary embodiment 48 bits are employed for encod- 



06/14/2004, EAST version: 1.4.1 



5,6: 

5 

ing a notebook, page, row, and column for a cell. Thus as the 
NaN (representing ER or NA) propagates, the address of the 
originating cell also propagates to all destination cells. 

The present invention also includes a preferred interface 
having an audit mode whereby the ERR or NA-propagated 
address is displayed at all destination cells. In this manner, 
the system allows the user to immediately view a source of 
error. As a result the user no longer needs to expend effort 
tracing through a multitude of formulas for tracking down 
the source of an ERR or NA value. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1A is a block diagram of a computer system in which 
the present invention may be embodied. 

FIG. IB is a block diagram of a software system of the 
present invention, which includes operating system, appli- 
cation software, and user interface components. 

FIG. 1C is a block diagram of an electronic spreadsheet 
system the present invention. 

FIG. 2A is a screen bitmap illustrating a spreadsheet 
notebook interface employed in a preferred embodiment of 
the present invention; a sample "spreadsheet" has been 
loaded. 

FIG. 2B is a bitmap of a toolbar component of the 
interface of FIG. 2A 

FIG. 2C-D are bitmaps illustrating page identifiers for 
rapidly accessing and manipulating individual pages of the 
notebook interface. 

FIG. 3A is a block diagram illustrating the IRRR format 
for representing floating-point numbers in a computer. 

FIG. 3B a block diagram illustrating register layout for a 
stack-based math coprocessor, such as found in the Intel 
80x87 line of coprocessors. 

FIG. 3C is a block diagram illustrating operation of a 
stack-based coprocessor for evaluating numeric expressions. 

FIG. 4A is screen bitmap illustrating occurrence of a 
spreadsheet NA ("not available*') value in the spreadsheet of 
FIG.2A. 

FIG. 4B is a screen bitmap illustrating occurrence of a 
spreadsheet ERR ("error") value in the spreadsheet of FIG. 
2A. 

FIG. 4C is a block diagram illustrating a method of the 
present invention for encoding spreadsheet NA and ERR 
values (and originating cell address) an NaN (Not a 
Number) values in TP&P. 80-bit TReal format and IRRR 
64-bit double format 

FIG. 4D is a block diagram illustrating register processing 
of an NA-pncoded NaN in a stack-oriented math coproces- 
sor. 

FIG. 4E is a block diagram illustrating register processing 
of an ERR-encoded NaN in a stack-oriented math copro- 
cessor. 

FIGS. 5A-B are block diagrams illustrating register pro- 
cessing of an ERR-encoded NaN together with an 
NA-encoded NaN in a stack-oriented math coprocessor. 

FIGS. 6A-B are screen bitmaps illustrating an interface 
for dynamically auditing spreadsheets. 

FIG. 7 is a block diagram illustrating the in-memory 
relation between cell records and compiled code formulas 
(which are stored in one or mare code segments). 

FIG. 8 is a block diagram illustrating in-memory layout 
for a formula compiled in accordance with the present 
invention. 
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DETAILED DESCRIPTION OF A PREFERRED 
EMBODIMENT 

The following description will focus on the presently 
preferred embodiments of the present invention, which are 

5 embodied in an electronic spreadsheet system operative in 
an Intel 80x86/80x87-comparible architecture. The present 
invention, however, is not limited to any particular environ- 
ment or any particular application. Instead, those skilled in 
the art will find that the teachings of the present invention 

io may be advantageously applied to a variety of different 
architectures, including Motorola 68000 series. PowerPC, 
Mips, and the like. Therefore, the description of the exem- 
plary embodiments which follows is for purposes of illus- 
tration and not limitation. 

15 General System 
A. Hardware 

As shown in FIG. 1A. the present invention may be 
embodied in a computer system such as the system 100, 
which includes a central processing unit (CPU) 101. a math 
coprocessor or floating-point processing unit (FPU) 115, a 

20 main memory 102, an input/output controller 103, a key- 
board 104, a pointing device 105 (e.g., mouse, trackball, pen 
device, or the like), a display device 106, and a mass storage 
107 (e.g., hard disk). Additional input/output devices, such 
as a printing device 108, may be included in the system 100 

25 as desired. As illustrated, the various components of the 
system 100 communicate through a system bus 110 or 
similar architecture. 

In a preferred embodiment, the computer system 100 
includes an IBM-compatible personal computer, which is 

30 available from several vendors (including IBM of Armonk, 
N.Y.). More particularly, the CPU 101 comprises an Mei 
80x86-class microprocessor (e.g., Intel 80386), and the FPU 
115 comprises a corresponding Intel 80x87-class math 
coprocessor (e.g., Intel 80387); both are available from Intel 

35 Corporation of Santa Clara, Calif. Compatible microproces- 
sors and coprocessors are available from a variety of 
vendors, including Advanced Micro Devices (Austin, Tex.) 
and Cyrix (Richardson, Tex.). Preferably, the CPU 101 
includes an on-chip FPU (as shown in FIG. 1A), as is 

40 available with an Intel 80486DX-class microprocessor (or 
later, such as Pentium® class). 
B. Software system 
1. Overview 

Illustrated in FIG. IB, a computer software system 120 is 

45 provided for directing the operation of the computer system 
100. Software system 120, which is stored in system 
memory 102 and on disk memory 107, includes a kernel or 
operating system 140 and an interface 160. One or more 
application programs, such as application software 125, may 

so be 'loaded** (i.e., transferred from storage 107 into memory 
102) for execution by the system 100. The system 100 
receives user commands and data through user interface 
160; these inputs may then be acted upon by the system 100 
in accordance with instructions from operating system mod- 

55 ule 140 and/or application module 125. The interface 160, 
which may support a character-based and/or preferably a 
graphical user interface (GUI), also serves to display results, 
whereupon the user may supply additional inputs or termi- 
nate the session. In a preferred embodiment, operating 

60 system 140 comprises MS-DOS® operating in conjunction 
with Microsoft® Windows; both are available from 
Microsoft of Redmond, Wash. Interface 160. on the other 
hand, is typically provided by the application programs and 
spreadsheet system 130, which will now be described. 

65 2. Electronic Spreadsheet 

As shown, system 120 also includes a spreadsheet system 
130 of the present invention. The spreadsheet system 130 
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interfaces with the system 100 through the operating system 
140. Spreadsheet system 130 may also be operably coupled 
to one or more spreadsheet add-in libraries 150, which are 
commercially available. 

The spreadsheet system 130 of the present invention is 
shown in further detail by a simplified block diagram of FIG. 
1C. The system 130 includes a spreadsheet engine or module 
135 and a graphing module 13S. In operation, the system 
130 maintains a matrix or "spread" of information cells 133, 
each cell being an atomic unit maintained and processed by 
the module 135. Data and formulas for the cells are receiv ed 
from an input device ( e.g„ Keyboard, mouse, disk and the 
like) for storage and processing by the system 130. From the 
cells, larger groupings, such as ranges (including blocks), 
sheets (including 2-D spreads), and groups of sheets 
(including 3-D spreads, linked sheets, and pages of sheets or 
"notebooks'*) may be defined. The graphing module 138 
provides the system 130 with the capability of displaying the 
spread of information cells visually, such as in various 
graphs. 

As shown, the spreadsheet engine 135 includes a Formula 
Evaluator 136 of the present invention. The Formula Evalu- 
ate* 136 processes the various formulas stored in a spread- 
sheet during spreadsheet recalculation. Of particular interest 
are methods of the Formula Evaluator for compiling spread- 
sheet formulas directly into machine code (for execution by 
the FPU 115 operating in conjunction with CPU 101). In this 
manner, the spreadsheet subsystem 130 greatly increases the 
efficiency with which spreadsheet recalculation may be 
performed by the system 100. Operation of the Formula 
Evaluator 136 is described in greater detail hereinbelow. 

In a preferred embodiment, the spreadsheet system 130 
includes Quattro® Pro for Windows, available from Borland 
International of Scotts Valley, Calif. A description of the 
general features and user operation of system 130 is avail- 
able in the documentation which accompanies Quattro® Pro 
for Windows. 
3. Notebook interface 

The spreadsheet system 130 includes a preferred note- 
book interface 200, shown in FIG. 2A, for receiving, 
(processing, and presenting information, including alphanu- 
hneric as well as graphic information. Notebook workspace 
1200 includes a menu bar 210, a toolbar 220, a current cell 
indicator 230, an input line 231, a status line 240. and a 
^notebook workspace 250. The menu bar 210 displays and 
invokes, in response to user inputs, a main level of user 
commands. Menu 210 also invokes additional pulldown 
submenus, as is known in windowing applications. Input 
line 231 accepts user commands and information for the 
entry and editing of cell contents, which may include data, 
formulas, macros, and the like. Indicator 230 displays an 
address for the current cursor (Le., active cell) position. At 
the status line 240, system 100 displays information about 
the current state of the workbook; for example* a 'HEADY" 
indicator means that the system is ready for the user to select 
another task to be performed. 

The toolbar 220, shown in further detail in FIG. 2B. 
comprises a row or palette of tools which provide a quick 
way for the user to choose commonly-used menu commands 
or properties. In an exemplary embodiment, toolbar 220 
includes cut, copy, and paste buttons 221, a power button 
tool 222, a graph tool 223, alignment buttons 224, font 
buttons 225, a style list 226 with pulldown, insert/delete and 
fit buttons 227, action (e.g., sort, sum, and the like) buttons 
228, and an "expert" (tutor) button 229. The functions of 
these buttons are suggested by their names. For instance, 
buttons 221 cut, copy, and paste data and objects to and from 
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Windows' clipboard. Tool 222 creates "powerbuttons" 
which allow a user to run spreadsheet macros; in a specific 
embodiment, powerbuttons appear as floating objects in a 
layer above spreadsheet cells. In a similar fashion, the graph 
5 tool 223 creates floating graphs that appear above spread- 
sheet cells. 

The notebook, which provides an interface for entering 
and displaying information of interest includes a plurality of 
spreadsheet pages. Each page, such as page A shown in 

10 notebook workspace 250, may include conventional win- 
dowing features and operations, such as moving, resizing, 
and deleting. In a preferred embodiment, the notebook 
includes 256 spreadsheet pages with one Graphs page, all of 
which are saved as a single disk file on the mass storage 107. 

15 Workspace 200 may display one or more notebooks, each 
sized and positioned (e.g., tiled, overlapping, and the like) 
according to user-specified constraints. 

Each spreadsheet page, in turn, includes a 2-D matrix of 
spreadsheet cells, arranged as a rectangular grid comprising 

20 columns and rows. At the intersection of a column with a 
row is a single celL A cell is a location where the user can 
enter data, formulas, and the like. Each cell is identified by 
an address determined by the column and row that contain 
it; in other words, the letter and number that make up the 

25 address are the cell's coordinates. Each column of cells is 
identified by a letter. Columns are marked with letters A-Z, 
then AA-AZ, BA-BZ and the like, up to IA-IV in a 
preferred embodiment Each row of cells is identified by a 
number. Rows are marked with numbers ranging from 1 to 

30 8 192, in a preferred embodiment Thus, for example, the cell 
in the top left corner is cell Al. 

A highlighted rectangle or "cell selector" is provided for 
indicating a currently active one (i.e., the cell that is cur- 
rently selected). This is the cell that will be affected by the 

35 next action, such as entering a value. The column containing 
the cell selector appears on the border in reverse video (ox 
in a contrasting color on a color screen); the row containing 
he cell selector appears in reverse video or a contrasting 
color, depending on the monitor. The user can move the 

40 selector to any cell in the spreadsheet 

To enter information in the spreadsheet, the user typically 
moves the selector to a desired cell (e.g., with a screen cursor 
185 or keyboard 102) and types in the entry. The characters 
typed in appear on the input line. When the entry is 

45 complete, the user presses Enter or an arrow key (Right 
arrow. Left arrow, PgUp, or the like). The system 130 writes 
the value into the current celt erasing any previous entry. If 
the user enters the data by pressing an arrow key, the selector 
moves in the appropriate direction. 

50 Multiple cells may be selected for aggregate operations. 
In particular, a cell "block" is defined as a rectangular group 
of cells identified by the cell addresses of two opposite 
corners— usually the upper left and bottom right cells. In 
response to user movement signals from the pointing device 

55 105. the cursor may be "dragged" across a range of cells for 
selecting a block, as is known in the art. Blocks may include 
contiguous or adjacent cells or ones which are non- 
contiguous. Once selected, blocks may be used in com- 
mands and formulas to act on several cells at once. 

60 Since a notebook may include a plurality of 2-D spread- 
sheets or pages, the notebook interface includes an identifier 
for each page. As shown in FIGS. 2C-D, individual note- 
book pages are identified by page identifiers 260, preferably 
located along one edge of a notebook. In a preferred 

65 embodiment, each page identifier is in the form of a tab 
member (e.g., members 261a, 262a, 263a) situated along a 
bottom edge of the notebook. Each tab member may include 
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representative indicia, such as textual or graphic labels, The architecture of the Intel math coprocessors is. to a 
including user-selected titles representing the contents of a large extent, dictated by the manner in which floating-point 
corresponding page. In FIG. 2C, the tab members 260 are set or "real" numbers may be represented in a digital computer, 
to their respective default names. For example, the first three Basically, real numbers are represented internally in a binary 
tab members (members 261a, 262a, 263a) are respectively 5 floating-point format comprising sign, exponent, and man- 
set to A, B, and C. Tab members are typically given tissa (fractional part) bit fields, as shown in FIG. 3A. Today, 
descriptive names provided by the user, however. As shown mos t floating-point processors support the popular tkrr 
in FIG. 2D, for example, the first three tab members have floating-point standard (IEEE-754, Standard for Binary 
nowbeensetto"Contente M ^ Floating-point Arithmetic). The IEEE standard defines a 
(tab member 262b), and Jan (tab member 263b). In a specific format (actual bit pattern) used to represent floating- 
simnar manner, the remaining tabs are set to subsequent * mt numbers Sin ^ e F / cisio / (32 _ 5ils Cg). double pre- 
months of the year. In this manner, the user associates the . . ... , *r . . , v , , on , ^ 
page identifiers with familiar tabs from an ordinary paper f ^n (64-bits long), and extended precision (80-bits long) 
notebook. Thus, the user already knows how to select a page tGI ^ are denned. 

or spread of interest: he or she simply selects the tab hternally, the Intel coprocessors use the extended or 

corresponding to the page (as one would do when selecting 15 ^mporary real" format for all calculations; all data types 

a page from a paper notebook). read froni memory are converted to the temporary real 

A general description of the features and operation of the format automatically. Accordingly, specific register archi- 

spreadsheet notebook interface may be found in Quattro® tecture of the Intel coprocessors is designed to accommodate 

Pro for Windows (Getting Started, User's Guide and Build- the 80-bit temporary real format As shown in FIG. 3B, the 

ing Spreadsheet Applications), available from Borland Inter- 20 Intel coprocessors include eight 80-bit numeric registers, 

national; additional description may be found in U.S. pat with each register being functionally divided to include a 

application Ser. No. 07/866,658, commonly owned by the 1-bit sign field, a 15-bit exponent field, and a 64-bit mantissa 

present assignee. field as shown. 

Numeric processing Although the registers may be accessed individually, most 

A. Floating-point processors 25 calculations are accomplished using a stack-oriented 

Before describing the Formula Evaluator 136 in detail, it approach: a program "pushes** values onto the stack from 

is helpful to review the architecture and operation of system memory, performs an operation on the values (e.g., 

floating-point processors or "math coprocessors." In this addition, subtraction, multiplication, division, and the like), 

manner, adaptation of the operation of math coprocessors for and "pops" the result from the stack back to system memory, 

processing compiled spreadsheet formulas may be under- 30 To push values from memory onto me stack, me coprocessor 

stood. Although a variety of floating-point microprocessors includes an FLD family of instructions. Operands are usu- 

are available, the following discussion will focus on Intel's ally pushed onto the Top of the Stack (TOS). Additional 

80x86780x87 family of rnicroprocessors and coprocessor. instructions are provided for arithmetic operations, such as 

Those skilled in the art will appreciate that the teachings of FADD for adding two floating-point numbers. An FST 

the present invention may be adapted for use with non-Intel 35 family of instructions are provided for copying a value at the 

architectures as well. top of the stack into another coprocessor register or system 

CPU microprocessors of the Intel 80x86 family are fun- memory, 

damentally integer processing units — designed to operate on B. Illustration of Stack-based operation 

a variety of integer data types. Integer processing units are Referring now to FIG. 3C, operation of stack-based 

ones which offer no direct support for operations on floating- 40 coprocessors, such as the Intel coprocessors, will be illus- 

point data type. Given the correct sequence of instructions, trated. Consider a simple calculation, C=AX+B, where A, B, 

an integer processing unit can of course perform operations C and X reference floating-point values stored in system 

on any numeric data type, including floating-point ones. memory. For stack-oriented evaluation, operand tokens of an 

However, this "software emulation** of floating-point is not expression are first loaded, then an operator is applied. This 

particularly fast and in fact can be quite slow. Floating-point 45 format commonly dubbed "Reverse-Polish Notation" 

microprocessors, in contrast, include intrinsic support for (RPN) or "postfix" format, is well known (and perhaps best 

floating-point data types and. thus, can substantially reduce known from its use in a popular line of pocket calculators), 

the amount of time required to perform floating-point opera- Evaluation of the above expression may be accomplished 

tions. with the following machine language instructions (for 80x87 

Each Intel 80x86 microprocessor has a corresponding 50 coprocessors): 
(optional) floating-point unit or "math coprocessor" with 

which it is compatible. The Intel 80286 microprocessor, for — — 

instance, may operate in conjunction with an Intel 80287 * 

math coprocessor. Similarly, an Intel 80386DX micropro- fmul sr,ST(i) 

cessor can function in tandem with an Intel 80387DX math 55 FLD b ' 

coprocessor (or even an 80287 math coprocessor). The fadd ST,ST(i) 



fstp 



(1) push a onto stack 

(2) push x onto stack 

(3) multiply a by x, result stored in ST 

(4) push b onto stack 

(5) add b to (a • x) 

(6) store result in c, pop from stack 



processor and coprocessor share a cooperative relationship. 

While the processor fetches and decodes instructions and 

calculates memory addresses, the coprocessor performs The first instruction, FLD a. tells the coprocessor to push the 

arithmetic operations on floating-point numbers. The 60 value stored in the memory location a onto the top of the 

instruction set of the coprocessor largely compliments that stack (i.e., into the first register, ST). The second instruction, 

of the processor, allowing floating-point operations to be FLD x, pushes the value of x onto the stack, thereby pushing 

integrated into an application program with rninimum dif- the value of a down the stack (i.e., into the second register), 

ficulty. Beginning with the 80486DX line of With two operands stored on the stack, the operator may 

microprocessors, Intel incorporated both a processor and a 65 now be applied. For the desired operation at this point (Le., 

coprocessor on the same chip, thus eliminating the need for multiplication), the instruction FMUL ST, ST(1) is 

a separate floating-point unit employed to instruct the coprocessor to multiply the values 
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stared in the top two registers, ST and ST(1); the result of the 
operation is stored on the top of the stack (Le., in ST). The 
stack now contains the values a * x (in ST) and a (in ST(1)). 

The value of b is now pushed onto the stack by the 
instruction ELD b. The value of b may now be added to the 5 
product of a * x by issuing the FADD instruction, which 
adds the value stored in the ST register to that stored in the 
ST(1) register. The result of the operation is again stored 
back into ST. The stack now stores in its registers values for 
ax+b, ax, and a. The value of ax+b, which is at the top of the 10 
stack, is the desired result of the calculation. It may be 
popped off the stack and stored into a memory location 
(denoted by c) by issuing the FSTP instruction. 
C Handling Coprocessor Exceptions 

Since the floating-point formats support a limited range of 
numbers the possibility exists that a result of a calculation 
may be outside of the range. Consequently, floating-point 
hardware must be able to detect these conditions. The TRRR 
standard specifies that an out-of-range value or "overflow" 
be replaced by one of two special values, +«> or -°o. These 
infinite values are represented by setting the exponent bits 
high (all 1 bits) and the mantissa bits low (all 0 bits). In this 
fashion, when infinite values are encountered in subsequent 
operations, they may be handled in a predictable fashion, 
such as -HxH-1.0 =», or 1/(-k»)=0. 

In addition to the possibility of overflow, there exists the 
possibility of calculations where no meaningful result can be 
determined. Examples include dividing 0 by 0, or subtract- 
ing -H» from itself. As in the case of overflows, the trrp. 
standard defines a special value (in addition to specifying 
that the error may be trapped, interrupting the calculation). 
The special value which indicates that no meaningful result 
is possible is called a **NaN" (Not a Number). NaNs are 
represented by a bit pattern having a maximum exponent (all 
exponent bits set to 1) and a non-zero mantissa (at least one 
mantissa bit set to 1). 

The IEEE standard specifies two types of NaNs: signal- 
ling and non-signalling NaNs. Signalling NaNs are typically 
trapped as errors. If either of the operands of an arithmetic 
operation is a non-signalling or "quiet" NaN, on the other 
hand, the result is simply another NaN; no error is trapped. 
In this manner, NaNs may be silently propagated to marV 
subsequent results as possibly unreliable; this propagation 
does not interrupt the sequence of operations, however. 
Moreover, the mantissa bits in the NaN format may be 
employed to encode an error identification, for indicating the 
reason for the generation of the NaN. 

To track these and other error conditions or "exceptions " 
including stack overflow and underflow, the Intel coproces- 
sor maintain a 16-bit "Status Word" register. The Status 
Word register, which is divided into fourteen fields, reports 
the current state of the coprocessor. The following table 
illustrates the layout of the status bits. 



TABLE 1 



Bits 



Purpose 



0 Exception Flag: Invalid Operation 

1 Exception Flag: Denormal Operation 

2 Exception Flag: Zero Divide 

3 Exception Flag: Overflow 

4 Exception Flag: Underflow 

5 Exception Flag: Loss of Precision 

6 Stack Flag (undefined on 8087 & 287) 

7 Error Summary (Interrupt Request on 8087) 

8 Condition Code 0 

9 Condition Code 1 
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TABLE 1 -continued 



Bits 



Purpose 



10 

11-13 

14 

15 



Condition Code 2 
Stack Top Pointer 
Condition Code 3 
Busy Flag 



15 



20 



25 



30 



35 



40 



45 



50 



When an error occurs in a calculation, the coprocessor sets 
one or more of the exception flag bits in the Status Word 
Register. 

A Control Word register is also provided for specifying 
operational options, under programming control. It includes 
the following bit layout 



55 



60 



65 



TABLE 2 



Bits 


Purpose 


0 


Exception Mask: Invalid Operation 


1 


Exception Mask: Denormal Operation 


2 


Exception Mask: Zero Divide 


3 


Exception Mask: Overflow 


4 


Exception Mask: Underflow 


5 


Exception Mask: Loss of Precision 


6 


Reserved 


7 


Interrupt Enable (8087 only) 


8-9 


Precision Control 


10-11 


Rounding Control 


12 


Infinity Control 


13-15 


Reserved 



By setting or clearing these bit fields, the programmer can 
exercise control over the operation of certain aspects of the 
coprocessor, including masking certain exceptions and 
errors. 

The architecture and operation of Intel coprocessors are 
documented in the technical, trade, and patent literature. 
Data sheets and reference manuals are available from Intel 
Corp. of Santa Clara, Calif. For additional description of the 
operation of the '387 coprocessor, see: 387DX Math Copro- 
cessor Data Sheet (Part No. 240448) and 387DX User's 
Manual, Programmer's Reference (Part No. 231917). For 
operation of the *486's FPU, see: i486 Microprocessor Data 
Sheet (Part No. 240440); i486 Microprocessor Program- 
mer's Reference Manual (Part No. 240486); and i486 Micro- 
processor Hardware Reference Manual (Part No. 240552). 
The disclosures of each of the foregoing are hereby incor- 
porated by reference. 

Formula Evaluation in an Electronic Spreadsheet 

With the reader having a basic understanding of the 
operation of a floating-point unit, methods of the present 
invention for evaluating and compiling formulas in an 
electronic spreadsheet may now be described. 
A. Cell Records and Cell Addresses 

A spreadsheet comprises an array of cells — storage loca- 
tions which may be represented in memory and on disk. 
Every cell used in a spreadsheet has a cell record associated 
with it. A cell record typically represents a cell value and 
attributes, for a particular cell location. An integer stored in 
a cell, for instance 5 stored in cell Al, may be represented 
by an integer cell record: 



Integer Oil Record 

rec_type = RHJhtegcrCell = 13 

rcc kn = 8 

rec__body 
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-continued 



BYTE 
BYTE 
WORD 



unsigned 
int 



column; 

page; 

row; 

unused : 3; 
attr_Jd : 13; 



// attribute id 
II value 



10 



14 



in a cell attribute lookup table, the cell record proper storing 
a terse attribute identifier (attr_Jd) referencing the appro- 
priate table entry which describes this cell (and possibly 
others). 

Formulas, which are stored in system memory at runtime 
as Formula Cell Information records, are represented inter- 
nally as opcodes or "p-code/* in familiar Reverse-Polish 
Notation (RPN) form. The opcodes may be converted back 
into text strings ("de-parsed") for easy on-screen editing of 
formulas by the user. In an exemplary einbodiment, a 
Formula Cell Information Object is created in system 
memory, the object being instantiated from the following 
C++ class: 



class_CEXPORT_JFmlCellInfo : public SSCeUInfo 
{ 



double 


value; 


// formula's values 


SSCellRef 


loc; 


// formula's location 


FmlCelllnfo 


♦next; 


// the next formula cell in the chain 


FmlCelUnfo 


*prev; 


// the prev formula cell in the chain 


void (*feval) ( ); 




// ptr to compiler formula (pCF) 


WORD 


state; 


// formula's state 


WORD 


wLen; 


// length of formula 


WORD 


wRefPos; 


// starting position of references 


char 


formula [ ] ; 


// formula bytecodes of variable length 



}; 



A floating-point cell may be represented by: 



Floating-pt Cell Record 

rec_type = KT_FloatCell = 14 

rec_len = 14 

rec_body 

BYTE column; 

BYTE page; 

WORD row; 

unsigned unused : 3; 

unsigned attr_Jd : 13; // attribute id 

double x; // value 



A "blank" (empty value) cell may be represented by: 



Blank Cell Record 

rec_type = KT_JBlankCell = 12 



rec_Ien = 6 
rec_Jxxfy 

BYTE column; 

BYTE page; 

WORD row; 

unsigned unused : 3; 

unsigned attr_id : 13; // attribute id 



Text labels, on the other hand, may be represented by: 



Label Cell Record 

«c_type = RT_JLabelCell = 15 
rcc_len = 7 + size of label + 1 



rec_body 

BYTE column; 

BYTE page; 

WORD row; 

unsigned unused : 3; 

unsigned attr_Jd : 13; // attribute id 

char prefix; // label prefix 

char label [ J ; 



Thus, as shown, location for the cell is stored by the 65 
column and row coordinates (the page may be deduced from 
the Page ID). Actual attributes for the cell may be specified 



As shown, an instantiated Formula Cell Information Object 
includes a value member and a formula[]array. The byte- 
code formula is stored in formula[]. The calculated result for 
the cell, on the other hand, is stored as value. The object also 
stores a pointer-to-compiled formula. (*feval) (). for the cell. 
If no compiled formula is available, the pointer is set to 
NULL. 

The cell address associated with each cell record corre- 
sponds with or is translatable into an on-screen location for 
the cell of the record. As shown by the cell records above, 
cell page, row, and column location are represented by single 
or multi-byte identifiers. In an exemplary embodiment, each 
cell reference resolves to a 6-byte identifier logically com- 
posed of: 

10 bits notebook ID 

6 bits reserved 

8 bits sheet ID 

8 bits column # 

3 bits reserved 

13 bits row # 

For instance. Column B is represented by the number 1, and 
row 3 is stored as 2 (there is no row 0). So a typical cell 
reference, like C7 (on the first page with only one notebook 
active), would be represented by a six-byte identifier of 
notebook 0 1 sheet 0 1 column 2 i row 6. Particular bits in the 
6-byte identifier may be employed for indicating absolute 
and relative addressing, as is known in the art The relative 
and absolute bits (which are not of interest here) are simply 
treated as reserved. 

Literature describing cell record and/or opcode formats 
for Lotus 1-2-3®, Microsoft Excel®, and Borland Quattro® 
Pro are available from the respective vendors. Additional 
description for Quattro® Pro may be found in U.S. pat 
application Ser. No. 08/029.807, commonly owned by the 
present assignee. 

B. Stack-based Recalculation of Formulas 

In operation, a spreadsheet formula evaluator. such as 
Evaluator 136. may employ a stack-based mechanism for 
evaluating formulas, similar in operation to that previously 



35 



50 



55 
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described for the coprocessor. The actual order employed for 2. Cell Addresses in Compiled Formulas 

recalculating a multitude of formulas may be any one of the To make compilation of formulas feasible, improved 

well-known methodologies (previously described), includ- methodology for looking up values in cells (Le., resolving 

ing natural, columnwise, rowwise, or the like. cell addresses) is required. As previously described, a math 

Given the expression A1+2*A2, an evaluator starts by 5 coprocessor loads operands from and stores results to system 

reading the cell reference of Al (more specifically, the memory; thus machine language for the coprocessor must 

six-byte cell identifier for Al), determining the memory include valid system memory addresses (not cell identifiers), 

location for the cell, fetching the value stored at that memory Moreover, as memory for cells is allocated in blocks or 

location, and pushing it onto a stack (e.g., coprocessor chunks (since it is generally not desirable to allocate system 

stack). The next token, 2, is read and its value is pushed onto ]Q memory for all cells present in a spreadsheet), it is not 

the stack; the coprocessor is then instructed to evaluate the feasible to predict beforehand where a given cell will be 

operands on the stack (an addition operation for this stored in memory (i.e., what allocated memory block, if any, 

example). The process may continue to operate in this contains the cell). 

manner until all operands have been evaluated, thereby The present invention recognizes that at runtime allocated 

leading to an end result (on the top of the stack). memory blocks for cell information are typically fixed (Le., 

Evaluation of spreadsheet formulas using a stack-based 15 not likely to move to new locations). This aspect may be 

token interpreter has been described in the technical, trade, exploited to improve cell lookups, making compiled formu- 

and patent literature. A typical implementation is described las practical. In a preferred embodiment, formulas are com- 

in FORMULA PROCESSOR, International Application No. piled to machine code upon the first recalculation operation, 

92307280.5, International Publication No. 0 527 622 A2, typically when a spreadsheet is first loaded (Le., when cell 

Feb. 17, 1993, which is hereby incorporated by reference. 20 records for the spreadsheet are loaded from disk into system 

Since opcode tokens are interpreted at runtime with each memory). Since the formulas are compiled after the cell 

recalculation operation, however, systems employing such records have been loaded into system memory, the various 

an approach lack performance benefits attendant with com- cell addresses in formulas may be resolved to actual 

piled formulas. addresses in system memory. For the cell Al, from the 

C. Spreadsheet Formula Compilation 25 formu i a m me example above, the system may determine a 

1. General particular memory address where storage for the cell has 

According ,to the present mention, operation of both the been ^^ed. This address for the cell may then be 

floatm g; point unit 115 and the Formula Evaluator 136 are ned ^to ^ machine ^ sequence for 

modified so that spreadsheet formulas may be compiled into ^7^^^^^. S g q 

native machine code for a target processor and coprocessor ^ c .*\ . WDm _T^ ™ *u » ♦ * A , 

(e.g., Intel 80x87/80x86). Although spreadsheet formulas 30 Smce *" memor y a <^ sses for cells are not fixed from 

may be compiled entirely to non-floating point machine session t0 session ' machme code for compiled formulas is 

code for execution by an integer processing unit, the best not saved to ^ k -' m a Furred embodiment. Specifically, as 

performance gains are realized when the formulas are com- * e mem0I 7 addresses for cell references are coded directly 

piled to floating point machine code for execution by a ^to compiled formulas and those addresses can only be 

system having a math coprocessor. 35 determined upon loading the spreadsheet into memory, it is 

At the outset, it should be appreciated that cell formulas not "seful to store compiled formulas from one session to 

cannot simply be "compiled" directly into native machine another, as the memory addresses in those compiled formu- 

code far a target processor. Consider, for instance, the las ^ change. The machine code is instead gener- 

evaluation of a simple formula: A1+B3. In simplistic terms, ated on-the-fly, at the first recalculation opportunity, with all 

one may mink of this as evaluation of the following expres- 40 subsequent recalculation operations employing the 

sion (in RPN form): Al B3+. In operation, however, this in-memory compiled formulas to greatly increase recalcu- 

does not simply involve fetching a value from Al, fetching lation speed. 

a value from B3, and then adding the two together, such as: Although memory addresses for cells are generally fixed 

once a spreadsheet has been loaded into memory, certain 
45 user operations may in fact change the memory addresses. 

FLD Al ;???-- not meaningful Memory addresses may be affected, far instance, when the 

Stttf vrvrtn ; ??? — not meaningful user deletes a spreadsheet row or column. In such an 

st,st(1) jwioogi instance, memory allocation for the cells of the spreadsheet 

typically changes, resulting in new (logical) memory 

Since a target coprocessor requires that values be loaded 50 addresses for the respective cells, 

from system memory, compiled machine language for the To correctly handle these situations, the system of the 

coprocessor requires v alid system memory addr esses (not present invention maintains a validity flag for indicating 

cell identifiers, such as Al and B3). Thus, bet ore spreadsheet whether an operation has occurred which would invalidate 

formulas may be compiled, an efficient mechanism for the compiled cell addresses (e.g., movement of a memory 

resolving cell identifiers as system memory addresses is 55 block). Operations which affect memory allocation include 

required. a call to invalidate the flag when an operation affecting cell 

Besides resolving addresses used in formulas at compile addresses occurs. On the next recalculation operation, new 

time, other problems must be addressed. For instance, Al compiled formulas are generated, using newly determined 

may store a number or a text string; or the cell may be empty cell addresses. At that point, the flag may be once again 

(blank), in which case storage for the cell has not even been 60 validated, for indicating availability of compiled formulas 

allocated. Moreover, the cell may store a special value, such for use with subsequent recalculation operations. As a prac- 

as ERR or NA, which requires special propagation to tical matter, operations which affect memory allocation 

dependent cells. In addition to managing data types of (e.g., adding and deleting rows) typically occur during the 

operands, a system must monitor for stack overflow and design phase of a particular spreadsheet model. Once that 

underflow. All told, a spreadsheet formula, such as A1+B3, 65 spreadsheet has been deployed to an end-user for use, e.g., 

cannot simply be "compfled"directly into machine code. as a tax template, user operations which would affect 

Instead, a system must first overcome a number of problems. memory allocation are typically infrequent 
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3. ERR and NA Propagation 

When evaluating formulas, a conventional floating-point 
formula interpreter must perform a number of error checks. 
Specifically, such an interpreter must correctly handle 
instances of spreadsheet error (ERR), such as divide-by- 
zero, or instances of a value being not available (NA), such 
as when a dependent spreadsheet is not loaded. Incorporat- 
ing numerous checks in compiled formulas, however, would 
significantly degrade performance, perhaps making formula 
compilation impractical. Accordingly, improved checking 
methodology is required for compiled formulas. 

According to the present invention, operation of the 
coprocessor is adapted to accommodate ERR and NA 
values, ensuring their proper handling and propagation. 
Industry-standard rules exist for propagating ERR and NA in 
electronic spreadsheets. ERR combined with a number is 
defined to evaluate to ERR (e.g., ERR+5=ERR). Similarly, 
NA combined with a number is defined to evaluate to NA 
(e.g., NA+5=NA). Finally, ERR combined with a NA must 
result in ERR (e.g., ERR+NA=ERR). 

As previously described, the Intel floating-point proces- 
sors include formats for positive and negative infinity and 
NaN ("Not a Number"). Consider storage of a real number, 
such as 1.2 e 20 (1.2 xlO 20 ). When the number is stored (in 
IEEE double format), it is converted into a specific binary 
format which includes 1 bit for the sign, 11 bits for the 
exponent and 52 bits for the mantissa. Recall further that the 
largest possible exponent is reserved for indicating infinity 
and NaNs. The bit sign still serves to indicate +°°or -*» (for 
positive or negative infinity). The mantissa bits indicate 
infinity when all are set to zero, or indicate NaN when at 
least one non-zero bit is present 

The system of the present invention employs these for- 
mats for representing spreadsheet ERR and NA values. 
Specifically, spreadsheet ERR and NA values are stored as 
(IEEE format) NaN values; ERR and NA may be distin- 
guishable from one another via the mantissa bits (e.g., one 
particular mantissa bit pattern for ERR, another for NA). In 
a spreadsheet, for instance, adding a cell with a value of 2.1 
to a cell with an ERR should yield a result or ERR. By 
encoding the ERR as a NaN, the coprocessor adds 2.1 to a 
NaN. thereby yielding a NaN (i.e.. ERR). Since adding a 
NaN (e.g., representing an ERR or a NA) to a non-NaN 
yields a NaN with an unchanged mantissa bit pattern (Le., 
preserving it as an ERR or NA), the floating-point proces- 
sor* s rules for propagating NaNs may be employed for 
correctly propagating ERR and NA values. 

Although an operation which combines a NaN with a 
non-NaN will propagate the NaN correctly (i.e., will propa- 
gate the NaN mantissa bit pattern without regard to the 50 
non-ERR operand), an operation may on occasion require 
the combination of two NaNs (e.g., adding a cell with NA to 
a cell with ERR). Standard spreadsheet rules for propagating 
ERR and NA require that ERR combined with NA yields 
ERR. The coprocessor rules for combining two NaNs may 
be employed to effect this result. In operation, the 
coprocessor, when required to combine two NaNs, sets the 
mantissa bins of the resultant to that of the NaN with the 
larger mantissa. By defining ERR to have a larger mantissa 
bit pattern than NA, the on-chip logic of the coprocessor 
may be employed to correctly propagate spreadsheet ERR 
over NA. In this fashion, the NaN propagating rules of the 
processor chip automatically ensure that ERR and NA 
values are correctly propagated, thus eliminating the need 
for the spreadsheet system to maintain special logic (e.g., in 
a compiled formula) for correctly propagating spreadsheet 
ERR and NA values. 
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4. ERR and NA Examples 

Referring now to FIGS. 4A-E, processing of NA and 
ERR for the spreadsheet of FIG. 2A will be illustrated. FIG. 
4A shows spreadsheet 410a having the value NA entered 
into cell B3 (by the user entering @NA — a spreadsheet 
function whose sole purpose is to return the value NA). As 
shown, the value NA in cell B3 propagates to cell B6, which 
stores the formula @SUM(B3.JB5). Similarly, the NA value 
of cell B3 propagates to cell H3, which stores a formula of 
@SUM(B3..G3). The NA value in cell H3 further propa- 
gates to cell H6; the cell H6 stores a formula of @SUM 
(H3..H5). 

In FIG. 4B, further modification is made to the spread- 
sheet (now, spreadsheet 410b) to include an ERR value. 
Specifically, the user has entered 1/0 (i.e., 1 divided by 0) in 
cell D3. As a result, cell D3 displays the value ERR. Cell D6 
which stores a direct reference to cell 03 (Le., @SUM 
(D3..D5)) also displays ERR. Similarly, the ERR value 
expressed in cell D3 also propagates to cell H3, which stores 
a formula referencing the cell D3 (as previously described). 
Note that the cell H3 includes references to both a cell 
storing NA (i.e., cell B3) and a cell storing ERR (i.e., D3). 
According to spreadsheet propagation rules (as described 
above), ERR propagates over NA. Finally, the ERR value of 
cell D3 propagates indirectly to cell H6, since H6 stores a 
reference to cell H3 (which in turn references cell D3. as 
previously described). 

FIG. 4C shows in detail the bit pattern layout for encoding 
NA and ERR as a "quiet" (non-signalling) NaN in both 
80-bit IEEE temporary real (TReal) and 64-bit tkkr double 
formats. As previously described, the exponent bits of a NaN 
are set high (i.e., equal to 1). Also required for the NaN (to 
distinguish it from infinity) is a non-zero mantissa. Thus, the 
most significant bit of the TReal mantissa (bit 63) is set high 
(for the double, it is implied to be 1). 

For indicating whether a NaN value is ERR or NA, the 
system of the present invention encodes information in bits 
60-62 of the TReaL After mapping the TReal into a 64-bit 
double, ERR and NA are represented as follows: 
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(1) 


NA without encoded address: 






0111 1111 1111 1010 


+ "Don't care" 




7 F F A 




(2) 


NA with encoded address: 






oai mi nil ion 


+ Encoded address 




7 F F B 




0) 


ERR without encoded address: 






oin mi mi noo 


+ "Don't care" 




7 F F C 






ERR with eroded address: 






oin mi mi noi 


+ Encoded address 




7 F F D 





Thus the fourth nibble (4 bits) of the 64-bit double includes 
a 4 trigger bit" for distinguishing NA (A or B) from ERR (C 
or D). Also, the nibble of ERR may be set to an E or F to 
indicate a more severe form of ERR (e.g., divide by zero). 
NA and ERR NaNs without addresses above (e.g., fourth 
nibble equal to A or C) are provided for compatibility with 
legacy systems (e.g., 1-2-3). 

The address of the cell where the ERR or NA originated 
is encoded in particular bits of the mantissa. As shown, 48 
bits are used in a preferred embodiment for encoding cell 
addresses; this is the 6-byte cell identifier previously 
described. 

Propagation of the NA value in spreadsheet 410a and 
propagation of the ERR value of spreadsheet 410f> within a 
coprocessor are illustrated in FIGS. 4I>-E. FIG. 4D illus- 
trates the operations which occur in the coprocessor during 
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evaluation of the formula for cell B6, @SUM(B3.,B5). First, values are added, thus leaving the [ERR, D3] NaN at the top 

the value for cell B3 is pushed onto the coprocessor stack of the stack (as the immediate result). At step 8, the value for 

(i.e., loaded into the ST register). Specifically, the ST cell F3, 3323, is pushed onto the stack. At step 9, the two 

register of the coprocessor is loaded with a quiet NaN having values are added, thus leaving the [ERR, D3]NaN at the top 

a mantissa encoding NA and the address of B3. As shown, 5 0 f the stack. At step 10, the value for cell G3, 4054, is pushed 

bits 451 are set to indicate NA (i.e. ? lower than ERR). Bits onto ^ stack> At step Uw &e ^ ^ values ^ mus 

453, on the other hand, encode the address of cell B3. Next, leaving ±e [E RR, D3]NaN at the top of the stack-the final 

the value stored in cell B4 900, is pushed onto the register result for ^ formula 0 f cell H3. As shown at step 12, the 

stackat step 2 At step 3 the two values ; are adcled together, yaluc be copied/moved (stQr ed/popped) into an appro- 

thus leaving the NaN which encodes NA and cell B3 on the 10 . te system memory location 

top of the stack (ie., as the immediate result). Next the value P v ry n. 

stored in cell B5, 750, is pushed onto the stack at step 4. At 5 - Use of Embedded Address for Auditing 

step 5, the two values are added together which again results A spreadsheet of even modest complexity contains many 

in the NaN encoding NA and cell B3 on the top of the stack. formulas, resulting in numerous intercell dependencies. An 

Thus the result of the formula for cell B6 is a NaN encoding 15 error occurring in any one cell may be difficult for the user 

NA and the cell address of B3. to track down and eliminate. Specifically, since the cell may 

FIG. 4E illustrates the specific operation of the coproces- in turn depend from a multitude of cells, and those cells may 

sorf or evaluating the formula of cell D6 in spreadsheet 410ft i n turn reference a multitude of other cells, the user would 

(of HG. 4B). As the first step, a value for ERR is pushed & a conventional system have to check the formula of each 

onto the coprocessor stack- This is done by pushing a quiet 20 ceU from which the cell directly or indirectly depends. All 

NaN encoding ERR and the cell address D3. Specifically, to i d , eliminating errors in a spreadsheet of even modest 

mantissa bits 461 are set high, for distinguishing this NaN complexity may be a daunting task for the user, 

from a NaN encoding NA^ The address bits 463 encode the FIGS. 6A-B illustrate use of the cell address encoded in 

i.^ SS ^ t~^ e C ^. whe ^f ^ ™ a V ^ UC ate ^ NA and ERR NaNs for quickly auditing the source of errors 

£** 2 l^^^ m ^^^ pM Tl 25 ^ a spreadsheet naLiUustrates selection of an Audit 

te register stadc A step 3 the two values are added ^ tool 610 fQr ^ ^hett 410 (now, spreadsheet 

together, thus leading to the NaN encoding ERR «id odl D3 4Mc) ^ tool ^ ^ nted tQ ±Q ^ JgZriri 

at the top of the stack (as the ^nnmediate .result) At step 4, ^ usef ^ Methodology, as shown. U?on selec- 

tiie value stored in cell D5 891 is loaded, and a step 5 the ^ tod ^ ^ & ~ eadsheet 410 ^ updated 

two values are added together. As before, a NaN combined 30 ac cWl , n . ™ *~ ^^^41^ 

... , T XT . ,j t , vt xt *t_ 1 ^ vt xt as shown in Flu. 6B (now, spreadsheet 410a). Trie spread- 

with a non-NaN yields the NaN, thus leaving the NaN * * ... A , v ' f\ , „ . , 

a . . " * / „ " ™r \ \ «T • 1 « sneet now displays the embedded cell address in those cells 

encoding ERR and cell D3 at the top of the stack. Hence, trt „, • , . f f, ' fM ™ r c m . a: ^„, aA . 

,. -° , , r ii r.L jl t0 which it propagates. Thus for the error value displayed in 

evaluation of the formula for cell D6 of the spreadsheet 410ft „ pll tl T ,J1 * Q „ wo/ , ;t „ A i^^ „ •„„ ' «. 

yields a NaN having ERR and the cell address of D3. ^ ,u ' u T y V ^ ^ g * ^ 

j wu« ai^uiivuiBciuvauuuic „ A A \ cell that the error value is a result of an error occurring in cell 

The intaplay between Na^ encodmg NA and those 35 D3 . Note, however, mat me ceUH6 does not depend oJrectly 

™* 15 " ed t "[ ™»; 5A-B RecaU that the from ^ ^ m ^ cdl R6 d ^ n6 ^ ctly from 

cell H3 of the spreadsheet 4106 ^ stores a formula which ceU H3 which, in turn, depends directly from cell D3 Thus, 

2S*V mrT/n 0 ,1 "* J"? ^Tl" me propagation of me address of the originating ceU to aU 

simplification, HG. 5A-B i w* represent values stored in cell5 , regardless of how many intervening cells 

coprocessor renters m shor^and format such as "[NA, 40 ^ ^ formulas ^ „ ^ ^ ^ 

U g^ resen ^ a encodin g 411(1 me for and the cell which is the source of the error, allows the 

06 As the first step, the value for cdl B3 is pushed onto the 

. , A . *?. . XT VT *. XT . JiU The user may immediately proceed to the source of the error, 

stack. As shown, this results in a NaN encoding NA and the w { r u rtllt ^,JLh;™ q «„ t w 11( ,k o ^^i^^^ ^ 

ad^ofcenBSbempto^in.heSrxegisL.Nextthe 45 ^^^17 8 ^ 3 ° f 

value stored in cell C3 is pushed onto the stack, as shown at f 

step 2. At step 3, the two values are added together, thus (^Compiling formulas 

leaving the [NA, B3] NaN at the top of the stack (as the Compiling formulas from their RPN token interpreter 

immediate result). At step 4, the value for cell D3 is pushed format to executable machine code occurs in several steps, 

onto the stack, thus resulting in a NaN encoding ERR and 50 ^ particular, several conditions should be met during com- 

the address of D3 being stored in the ST register. At step 5, pilation for the compiler of the Evaluator to generate a 

the two top values (i.e., in ST and ST(1)) are added together, compiled formula. In the currently preferred embodiment, 

this operation results in the [ERR, D3] NaN at the top of the me compiler may attempt to compile all formulas, but 

stack (as the immediate result). It is important to note at mis generally it will not compile formulas with complicated 

step that the [NA. B3] NaN combined with the [ERR, D3] 55 ©-functions or transcendental functions. 

NaN yields a result of [ERR, D3] NaN. In other words, the In an exemplary embodiment the formula compiling 

ERR value of cell D3 propagated over the NA value of cell method is performed by scanning the tokens of the formula 

B3. in two passes. In a first pass, the tokens are checked for 

Continuing with the example, at step 6 the value for cell "compilability" (Le., ability to be compiled into 80x86780x 

E3, 2724, is pushed onto the stack. At step 7 the two top 87 instructions). Also at this pass, a reference list is built 



Formula Compiling: 
// fiist pass: 

// scan token, checking token compilability and building reference list 
while not end of formula 
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-continued 



check token type far comp liability 
if token is an operand 
if operand is constant, store value in reference list, 

otherwise 

locate cell reference and attempt to generate pointer to cell 
pointer 

if cell pointer/body is not allocated then fail 
otherwise store pointer in reference table 
end while 



As shown, the reference list is built by storing a value in the 
list for each constant and storing a pointer in the list for each 
cell reference. If a cell being referenced is not available (Le., 
pointer to cell cannot be generated), then the attempt to 
compile the formula is abandoned (i.e., the first pass fails). 

In the second pass, the compiler generates the actual 
compiled formula for the formula by scanning the tokens 
and emitting a machine language sequence (for the target 
processor/co-processor). 



corresponding compiled formula. Compiled formulas for a 
spreadsheet are stored in one or more code segments (e.g.. 
Code Segment 720, 730). The upper-size limit of any one 
particular code segment depends on system implementation; 
in an Intel 80x86 segmented architecture, for instance, 
segments are limited to no larger than 64K. For the cell 
record 710. ^CF X points to Compiled Formula^ which 
resides in Code Segment 720. In the event that the pointer 
to the compiled formula is set to NULL, the Formula 



// second pass: generate code for each token type 
while not end of formula 

switch on token type to code generation subroutine for the class 
of token found 
for push constant value generate push offset to value of the 
correct data type 

for push argument tokens generate onset and call linkage to 
push value tunc. 

for simple operators, generate appropriate floating point 
instruction 

for complex operators, generate several floating point 
instructions 

for simple ©functions, generate code sequence for the CPU and 
FPU that 

implement the function. Also use calls to push value 
functions. 

end while 



As shown, the method proceeds based on the type of token. 
For a constant value (e.g., 5 in Al +5), an offset to the value 
(i.e., an offset into the reference list) is generated. For an 
argument token (Le., argument to a function), an offset is 
generated as well as a call linkage to the corresponding 
function. When a simple operator is encountered (e.g., **+"), 
an appropriate floating point instruction is generated (e.g., 
FADD for "+"). For a more complex operator, a series of 
instructions implementing the operator are generated. Far 
©-functions, a code sequence that implements the function 
is generated for the target processor/coprocessor. At the 
completion of the two-pass method, a compiled formula has 
been generated; this is stored in system memory at a location 
pointed to by the cell's pointer-to-compiled formula (pCF). 
7. Compiled formula storage and execution 

In addition to compiling a formula into a native machine 
language sequence, the present invention provides methods 
for compiled formula storage and execution. As shown in 
FIG. 7, the compiled formulas are stored in one or more code 
segments (e.g., segments 720, 730). The specific in-memory 
relation between cell records and compiled code formulas, 
as well as the internal organization of a compiled formula, 
will now be described. 

FIG. 7 shows the in-memory layout of a cell record 710. 
The cell record includes, among other things, a pointcr-to- 
compiled formula (pCF), a formula result (64-bit trrr 
double), and a formula array (storing the formula p-code). 
Hie pointer to the compiled formula for the cell record 
points to an offset in a code segment which stores the 



Evaluator will fallback to standard interpreter-based formula 
evaluation (i.e., evaluating the p-code sequence stored in the 
cell record's formula array with a runtime interpreter). The 
pointer is set to NULL when a formula cannot be compiled. 

As shown, each code segment includes a copy of Helper 
Functions 705. The Helper Functions 70S are useful routines 
for assisting with the execution of compiled formulas. In an 
exemplary embodiment, one using Intel segmented 
architecture, the Helper Functions are stored in the same 
segment — that is. as a "near" library. In this manner, a 
compiled formula may invoke Helper Functions in its code 
segment as a "near" call (thus resulting in faster execution). 

The layout for the Compiled Formula is shown in detail 
by memory block 750. In particular, the Compiled Formula 
Memory Block 750 comprises a first section for storing data 
and a second section for storing compiled (machine 
language) code. The data section typically stores the refer- 
ence list as a table comprising pointers to cells which have 
been referenced within the formula. After the cell lookups, 
the data section stores any constant values (e.g., the value 
900 in the formula +C3*F4/900). Finally, the data section 
stores a pointer which points back to the formula result 
(64-bit IEEE double) in the corresponding in-memory cell 
record; use of this pointer is described below. 

Following the data section in the compiled formula is the 
compiled code proper. This comprises the literal native 
machine language sequence which is to be executed by the 
target processor/coprocessor. The pointer-to-compiled for- 
mula (pCF) points here. The compiled code itself stores 
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memory-address operands which point back to the cell fmult sttixst 
addresses and/or constants stored in the data section. Hie 

compile code concludes by executing a jump instruction to This leaves the value of C3 * F4 on the top of the stack, 
a Helper Function in the near library which stores the result 

ofmeCompUedFonnula and exits (i.e., returns control back 5 ^ compiled code routine next proceeds to load the 

to the Formula Evaluator). address of the literal value in the formula (900) in a 

FIG. 8 illustrates the construction and layout of a code processor register: 
segment storing compiled formulas in greater detail. At 

offset 0000 in the code segment, the near library 801 of mov bx,0269 

Helper Functions is stored. An exemplary embodiment of io 

to t Pr ^lo a ? US *- VALU ? and then loads the pointed-to value into the coprocessor 

and a STORE_AND JKTT function. The operation of each stacfc; r 

of these is described in further detail hereinbelow. 

Following the near library is the first compiled formula 

810. The exact layout of compiled formulas will be illus- 15 FrR I BX l 

trated using the example of +C3* F4/900 stored in the Cell 

Bl. Hie Compiled Formula (810) includes a data section 811 The value (Le,, 900 or 0269h) is now on top of the copro- 
and a compiled code section 813. The data section 811 stores cessor stack. 

preferences and/or values w ^ha^ in the formula. ^ a floati ^ ^ ision instruction be 
For the example at hand (l .e.. C3*F4/900), fte foUowmg 20 executed for dividm ^ C 3*F4 (stored on the stack) by the 

cells and/or data members are present: C3* F4, and 900. f © \ > j 

Accordingly, the data section 811 stores pointers to cells C3 v ue 0 
and F4, and stores the literal constant of 900. Finally, the 
data section 811 stores a pointer which points back to the FDIVP ST(l),ST 

formula result (64-bit double) of the cell record. 25 
The compiled code for the formula is stored in the The result of the formula is now on the top of the stack. 

coinpfledcodese^^ pointer-to-formula result into a 

the pointer-to-compiled formula (pCF) for the cell Bl points 7 "~ ^ 

to here. An exemplary machine language sequence for the P rocessor regis . 

formula of the present example is as follows. First, a pointer 30 

to C3's record is loaded into a processor register: M0V BX£26B 

movbx,026i and executes a jump to the STORE_AND_EXJT near 

THis serves as a parameter to the PUSH_VALUE function. 35 faaak ^ for St0± * rcsult back 0Ut t0 me ceU 
Next, the routine calls (near call) to the PUSH„ VALUE 

Helper Function: **** STORE_AND_Exrr 

CALLPUSH_value The STORE_AND_JEXTT function serves to store the 

t i ... . nTTP „ wat ttt7 ^ ^- An result back out to the cell record, performing any necessary 

In an exemplary embodiment, the PUSH_VALUE function 40 , & ' J 

operates^u*thevaIueforthece]l(h e re ) ceUC3)ontothe TkTk a ^ oaya,u f to ERR). This may be accom- 

topof ^coprocessor's stack. The PUSH_VALUE function the i^wmg steps (again shown in pseudocode 

may accomplish this by the following steps (shown in tonnat): 

pseudocode format): Procedure: Store and Exit 



Procedure: Push Value 

load pointer to cell body of value referenced 
check the data type of the cell 

for a blank cell push 0 

for a string value push 0 

for an integer push it on the floating point units floating point 
number stack 

for a formula result or fractional double value push the double 



55 

At the conclusion of the PUSH_VALUE routine, an appro- 
priate value for the cell (C3) has been pushed on the top of 
the coprocessor's stack 

The compiled code routine proceeds in a similar manner 
to load the value of the F4 cell (by loading its pointer and 60 
pushing the pointed to value): 

MOVBX.0265 
CALL PUSH_VALHQE 

Now, values for C3 and F4 have been stored on the stack. 65 
Next, the two top coprocessor registers are multiplied 
together, thus multiplying the value of C3 by F4: 
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load painter to formulas value from table using offset provided as a 
parameter 

store value from floating point unit as a double 
check floating point unit status ton 

overflow, divide by zero and invalid operation 

if no error detected, then exit 

else assign a code and copy the cell coordinate to the matissa 
area of the double 

for all errors assign 0 to the sign bit, 07FFh to the exponent 

and: 

for invalid operation assign GFh to the MSB's of the 
mantissa value 

for overflow assign OEh to the MSB's of the mantissa value 
for divide by zero assign ODh to the MSB's of the mantissa 
value 

and copy the formula cell coordinate to the LSB's of the 
mantissa 

finally clear the exception state in the floating point unit 



At the conclusion of the routine an appropriate result for the 
formula has been stored back out to the cell record and 
control has returned to the Formula Evaluator, which may 
then proceed to evaluation of the next (compiled or 
uncompiled) formula. 

Attached hereto is an Appendix containing ASM source 
code listings providing a description of Helper Functions of 
the present invention, suitable for use in a general purpose 
digital computer system such as an IBM-compatible per- 
sonal computer with an Intel 80x87-class FPU. A suitable 
assembler for compiling the code is available from several 
vendors, including Borland International of Scotts Valley, 
Calif. 



20 While the invention is described in some detail with 
specific reference to a single preferred embodiment and 
certain alternatives, there is no intent to limit the invention 
to that particular embodiment or those specific alternatives. 
Although an exemplary embodiment using an Intel 80x87- 

25 class FPU is described for instance, those skilled in the art 
will appreciate that other IEEE-compliant FPUs may be 
employed. Thus, the true scope of the present invention is 
not limited to any one of the foregoing exemplary embodi- 
ments but is instead defined by the appended claims. 



APPENDIX: Helper Functions for Intel Architecture 



; Helper Functions stored as near library 
; Copyright €> 1994 Borland International 
PUSHVALUE: LES 
TEST 
JE 
LES 
TEST 
JE 

CMP 
JE 
FLD 
RET 

PVl: FLDZ 
RET 

PVZ: FILD 
RET 

STORE_AND_EXrT: 

LES 

FSTP 

FNSTSW 

AND 

JNE 

RETF 

SE^EXCEPT: MOV 
TEST 
JNE 
DEC 
TEST 
JNE 
DEC 

SE_EX1: CALL 
WATT 
FNCLEX 
RETF 

SE__WRrrE_EXCEPT: 

MOV 
MOV 
MOV 



BX, CS: (BX] ; load segment offset 

ES:BYIEPTR[BX3,01 

PVl ; loadO 

BX£S:[BX] 

ES:BYTE PTR[BX-01], 06 
PVZ 

ES:WORD PTR[BX407], 7FF8 
PVZ 

ES:QWORD PTR[BX+01] 



ES:WORD FTR[BX+02] 



BX, CS:[BX] ; load ptr to formula result 
E&QWORD PTRfBXl ; pop result 
AX ; dump status word 
AL, ODh ; check bits 0, 2, 3 
SE__EXCEPT 

DX, 7FFFh ; ERR bit pattern 

AL, Olh ; Test status word for exceptions 

SE_EX1 

DL 

AL,08h 
SE—EXl 
DL 

SE_WRnE_JEXCEPT 



; Write special values to formula double 
EAX, ES:[BX408J 
ES:[BX], EAX 
AX, ES:[BX+OCh] 
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-continued 
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APPENDIX: Helper Functions for Intel Architecture 



MOV ES:[BX+04], AX 

MOV ES:[BX+06], DX ; Note DX value 

RET 



What is claimed is: 

1. In an electronic spreadsheet system having an IEEE- 
compliant floating-point unit said floating-point unit having 
processor registers comprising sign, exponent, and mantissa 
bit fields, each field being arranged from a most-significant 
bit to a least-significant bit, a method for storing in the 
floating-point unit spreadsheet values representing industry- 
standard spreadsheet error (ERR) and not available (NA) 
values, the method comprising: 

(a) reserving at least one bit of the mantissa field for 
distinguishing a spreadsheet value representing ERR 
from one representing NA, wherein said at least one bit 
is set high if the spreadsheet value represents ERR; and 

(b) storing each spreadsheet value representing ERR and 
NA as an IEEE-compliant Not-a-Number (NAN) value 
in a register of the floating-point unit by: 

(i) setting all bits in the exponent field of the register to 
high, 

(ii) setting the most-significant bit of the mantissa field 
to high, and 

(iii) setting said at least one bit to high if the spread- 
sheet value represents ERR. 

2. The method of claim 1, wherein the floating-point unit 
is constructed so that arithmetic operations of the floating- 
point unit which combine a NaN value with a non-NaN 
value result in a value identical to the NaN value, the method 
further comprising: 

storing a first particular spreadsheet NaN value in the 
floating-point unit;. 

storing a second particular spreadsheet non-NaN value in 
the floating-point unit; and 

combining the first and second particular spreadsheet 
values in the floating-point unit whereby the construc- 
tion of the floating-point unit provides a result having 
a value identical to the NaN value. 

3. The method of claim 1, wherein the .floating-point unit 
has a construction where arithmetic operations of the 
floating-point unit which combine a first NaN value with a 
second NaN value result in a value identical to the NaN 
value having the larger mantissa value, the method further 
comprising: 

storing a first particular spreadsheet NaN value represent- 
ing ERR in the floating-point unit; 

storing a second particular spreadsheet NaN value repre- 
senting NA in the floating-point unit; and 

combining the first and second particular spreadsheet 
values in the floating-point unit, whereby the construc- 
tion of the floating-point unit provides a result having 
a value identical to the NaN value representing ERR 
due to the setting of said at least one bit to high in step 
(b) (Hi). 



4. The method of claim 1, further comprising: reserving a 
10 plurality of bits in the mantissa field for encoding a cell 

identifier for indicating a particular cell of the spreadsheet 

where the spreadsheet value originated, so that said cell 

identifier propagates to dependent cells. 
i5 5. The method of claim 4, further comprising: indicating 

to a user originating cells for spreadsheet ERR and NA 

values by: 

(i) decoding all encoded cell identifiers into cell 
addresses; and 

2Q (ii) displaying in dependent cells the cell addresses for 
cell identifiers which have propagated to dependent 
cells. 

6. In an electronic spreadsheet system having an IEEE- 
compliant floating-point unit, said floating-point unit having 

25 processor registers comprising sign, exponent and mantissa 
bit fields, each field being arranged from a most-significant 
bit to a least-significant bit a method for staring in the 
floating-point unit spreadsheet values representing Not-a- 
Number (NAN) values, the method comprising: 

30 

(a) reserving a plurality of bits in the mantissa field for 
encoding a cell identifier for indicating a particular cell 
of the spreadsheet where the spreadsheet value 
originated, so that said cell identifier propagates to 

35 dependent cells; 

(b) for a spreadsheet cell giving rise to an NaN value, 
representing the NaN value in a register of the floating- 
point unit by: 

(i) setting all bits in the exponent field of the register to 

40 high* 

(ii) setting at least one bit of the mantissa field in the 
register to high, and 

(iii) setting said plurality of bits in the mantissa field of 
the register to encode a cell identifier for the par- 

45 ticular cell of the spreadsheet where the NaN spread- 

sheet value originated; 
whereby, when the spreadsheet cell representing the NaN is 
combined with a non-NaN value, the result is the NaN value 
with the cell identifier, so encoded, thereby propagating the 

50 cell identifier giving rise to the NaN value to a dependent 
cell. 

7. The method of claim 6, further comprising indicating to 
a user originating cells for spreadsheet NaN values by: 

55 (i) decoding all encoded cell identifiers in NaN cells into 
cell addresses; and 
(ii) displaying in dependent cells the cell addresses for 
cell identifiers which have propagated to dependent 
cell. 

* * * * * 
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